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(57) Abstract 

A virai vector production system is provided which system comprises: (i) a viral genome comprising at least one first nucleotide 
sequence encoding a gene product capable of binding to and effecting the cleavage, directly or indirectly, of a second nucleotide sequence, 
or transcription product thereof, encoding a viral polypeptide required for the assembly of viral particles; (ii) a third nucleotide sequence 
encoding said viral polypeptide required for the assembly of the viral genome into viral particles, which third nucleotide sequence has a 
different nucleotide sequence to the second nucleotide sequence such that said third nucleotide sequence, or transcription product thereof, 
is resistant to cleavage directed by said gene product. The viral vector production system may be used to produce viral particles for use in 
treating or preventing viral infection. 
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ANTI-VIRAL VECTORS 



PCT/GB99/00325 



Field of the Invention 

5 The present invention relates to novel viral vectors capable of delivering anti-viral 
inhibitory RNA molecules to target cells. 

Background to the Invention 

10 The application of gene therapy to the treatment of AIDS and HIV infection has been 
discussed widely (14). The types of therapeutic gene proposed usually fall into one of two 
broad categories. In the first the gene encodes protein products that inhibit the vims in a 
number of possible ways. One example of such a protein is the RevMlO derivative of the 
HIV Rev protein (16). The RevMlO protein acts as a transdominant negative mutant and 

15 so competitively inhibits Rev function in the virus. Like many of the protein-based 
strategies, the RevMlO protein is a derivative of a native HIV protein. While this provides 
the basis for the anti-HIV effect, it also has serious disadvantages. In particular, this type 
of strategy demands that in the absence of the virus there is little or no expression of the 
gene. Otherwise, healthy cells harbouring the gene become a target for the host cytotoxic 

20 T lymphocyte (CTL) system, which recognises the foreign protein (17, 25). The second 
broad category of therapeutic gene circumvents these CTL problems. The therapeutic gene 
encodes inhibitory RNA molecules; RNA is not a target for CTL recognition. The RNA 
molecules may be anti-sense RNA (15, 3 1), ribozymes (5) or competitive decoys (1). 

25 Ribozymes are enzymatic RNA molecules which catalyse sequence-specific RNA 
processing. The design and structure of ribozymes has been described extensively in the 
literature in recent years (3, 7, 31). Amongst the most powerful systems are those that 
deliver multitarget ribozymes that cleave RNA of the target virus at multiple sites (5, 21). 

30 In recent years a number of laboratories have developed retroviral vector systems based on 
HIV (2, 4, 18, 19, 22-24, 27, 32, 35, 39. 43). In the context of anti-HIV gene therapy these 
vectors have a number of advantages over the more conventional murine based vectors 
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sueh as murine leukaemia virus (MLV) vectors. Firstly, HIV vectors would target 
precisely those cells that are susceptible to HIV infection (22, 23). Secondly, the HIV- 
based vector would transduce cells such as macrophages that are normally refractory to 
transduction by murine vectors (19. 20). Thirdly, the anti-HIV vector genome would be 
s propagated through the CD4r cell population by any virus (HIV) that escaped the 
therapeutic strategy (7). This is because the vector genome has the packaging signal that 
will be recognised by the viral particle packaging system. These various attributes make 
HIV -vectors a powerful tool in the field of anti-HIV gene therapy. 

10 A combination of the multitarget ribozyme and an HIV-based vector would be attractive as 
a therapeutic strategy. However, until now this has not been possible. Vector particle 
production takes place in producer cells which express the packaging components of the 
particles and package the vector genome. The ribozymes that are designed to destroy the 
viral RNA would therefore also interrupt the expression of the components of the HIV- 

15 based vector system during vector production. The present invention aims to overcome 
this problem. 

Summary of the Invention 

20 It is therefore an object of the invention to provide a system and method for producing viral 
particles, in particular HIV particles, which carry nucleotide constructs encoding inhibitory 
RNA molecules such as ribozymes and/or antisense RNAs directed against a 
corresponding virus, such as HIV, within a target cell, that overcomes the above-mentioned 
problems. The system includes both a viral genome encoding the inhibitory RNA 

25 molcules and nucleotide constructs encoding the components required for packaging the 
viral genome in a producer cell. However, in contrast to the prior art, although the 
packaging components have substantially the same amino acid sequence as the 
corresponding components of the target virus, the inhibitory RNA molecules do not affect 
production of the viral particles in the producer cells because the nucleotide sequence of 

30 the packaging components used in the viral system have been modified to prevent the 
inhibitory RNA molecules from effecting cleavage or degradation of the RNA transcripts 
produced from the constructs. Such a viral particle may be used to treat viral infections, in 
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Accordingly the present invention provides a viral vector system comprising: 
til a first nucleotide sequence encoding a gene product capable of binding to and 
5 effecting the cleavage, directly or indirectly, of a second nucleotide sequence, or 
transcription product thereof, encoding a viral polypeptide required for the assembly ol 
viral particles; and 

(ii) a third nucleotide sequence encoding said viral polypeptide required for the 
assembly of viral particles, which third nucleotide sequence has a different nucleotide 
10 sequence to the second nucleotide sequence such that the third nucleotide sequence, or 
transcription product thereof, is resistant to cleavage directed by said gene product. 

In another aspect, the present invention provides a viral vector production system 
comprising: 

15 (i) a viral genome comprising at least one first nucleotide sequence encoding a gene 
product capable of binding to and effecting the cleavage, directly or indirectly, of a second 
nucleotide sequence, or transcription product thereof, encoding a viral polypeptide required 
for the assembly of viral particles; 

(ii) a third nucleotide sequence encoding said viral polypeptide required for the 
20 assembly of the viral genome into viral particles, which third nucleotide sequence has a 
different nucleotide sequence to the second nucleotide sequence such that said third 
nucleotide sequence, or transcription product thereof, is resistant to cleavage directed by 
said gene product. 

25 The gene product is typically an RNA inhibitor,' sequence selected from a ribozymc and an 
anti-sense ribonucleic acid, preferably a ribozyme. 

Preferably, the viral vector is a retroviral vector, more preferably a lentiviral vector, such as 
an HIV vector. The second nucleotide sequence and the third nucleotide sequences are 
30 typically from the same viral species, more preferably from the same viral strain. 
Generally, the viral genome is also from the same viral species, more preferably from the 
same viral strain. 
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In the case of retroviral vectors, the polypeptide required for the assembly of viral particles 
is selected from gag, pol and env proteins. Preferably at least the gag and pol sequences 
arc lenti viral sequences, more preferably HIV sequences. Alternatively, or in addition, the 
5 env sequence is a lentiviral sequence, more preferably an HIV sequence. 

In a preferred embodiment, the third nucleotide sequence is resistant to cleavage directed 
by the gene product as a result of one or more conservative alterations in the nucleotide 
sequence which remove cleavage sites recognised by the at least one gene product and/or 
10 binding sites for the at least one gene product. For example, where the gene product is a 
ribozyme, the third nucleotide sequence is adapted to be resistant to cleavage by the 
ribozyme. 

Preferably the third nucleotide sequence is codon optimised for expression in host cells. 
15 The host cells, which term includes producer cells and packaging cells, are typically 
mammalian cells. 

In a particularly preferred embodiment, (i) the viral genome is an HIV genome comprising 
nucleotide sequences encoding anti-HIV ribozymes and/or anti-HIV antisense sequences 

20 directed against HIV packaging component sequences (such as gag.pol) in a target HIV 
and (ii) the viral system for producing packaged HIV particles further comprises nucleotide 
constructs encoding the same packaging components (such as gag.pol proteins) as in the 
target HIV wherein the sequence of the nucleotide constructs is different from that found in 
the target HIV so that the anti-HIV ribozyme and/or antisense HIV sequences cannot effect 

25 cleavage or degradation of the gag.pol transcripts during production of the HIV particles in 
producer cells. 

The present invention also provides a viral particle comprising a viral vector according to 
the present invention and one or more polypeptides encoded by the third nucleotide 
30 sequences according to the present invention. For example the present invention provides 
a viral particle produced using the viral vector production system of the invention. 
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In another aspect, the present invention provides a method lor producing a viral particle 
which method comprises introducing into a host cell (i) a viral genome vector according to 
the present invention; (ii) one or more third nucleotide sequences according to the present 
invention; and (hi) nucleotide sequences encoding the other essential viral packaging 
components not encoded by the one or more third nucleotide sequences. 

The present invention further provides a viral particle produced using by the method of the 
invention. 

10 The present invention also provides a pharmaceutical composition comprising a viral 
particle according to the present invention together with a pharmaceutically acceptable 
carrier or diluent. 

The viral system of the invention or viral particles of the invention may be used to treat 
!5 viral infections, particularly retroviral infections such as lentiviral infections including HIV 
infections. Thus the present invention provides a method of treating a viral infection which 
method comprises administering to a human or animal patient suffering from the viral 
infection an effective amount of a viral system, viral particle or pharmaceutical 
composition of the present invention. 

20 

The invention relates in particular to HIV-bascd vectors carrying anti-HIV ribozymes. 
However, the invention can be applied to any other virus, in particular any other lenti virus, 
for which treatment by gene therapy may be desirable. The invention is illustrated herein 
for HIV. but this is not considered to limit the scope of the invention to HIV-based anti- 
:s HIV vectors. 

Detailed Description of the Invention 

The term "viral vector 1 refers to a nucleotide construct comprising a viral genome capable 
30 of being transcribed in a host cell, which genome comprises sufficient viral genetic 
information to allow packaging of the viral RNA genome, in the presence of packaging 
components, into a viral particle capable of infecting a target cell. Infection of the target 
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cell includes reverse transcription and integration into the target cell genome, where 
appropriate for particular viruses. The viral vector in use typically carries heterologous 
coding sequences (nucleotides of interest) which are to be delivered by the vector to the 
target cell, for example a first nucleotide sequence encoding a ribozyme, A viral vector is 
5 incapable of independent replication to produce infectious viral particles within the final 
target cell. 

The term " viral vector system" is intended to mean a kit of parts which can be used when 
combined with other necessary components for viral particle production to produce viral 

10 particles in host cells. For example, the first nucleotide sequence may typically be present 
in a plasmid vector construct suitable for cloning the first nucleotide sequence into a viral 
genome vector construct. When combined in a kit with a third nucleotide sequence, which 
will also typically be present in a separate plasmid vector construct, the resulting 
combination of plasmid containing the first nucleotide sequence and plasmid containing 

15 the third nucleotide sequence comprises the essential elements of the invention. Such a kit 
may then be used by the skilled person in the production of suitable viral vector genome 
constructs which when transfected into a host cell together with the plasmid containing the 
third nucleotide sequence, and optionally nucleic acid constructs encoding other 
components required for viral assembly, will lead to the production of infectious viral 

20 particles. 

Alternatively, the third nucleotide sequence may be stably present within a packaging cell 
line that is included in the kit. 

25 The kit may include the other components needed to produce viral particles, such as host 
cells and other plasmids encoding essential viral polypeptides required for viral assembly. 
By way of example, the kit may contain (i) a plasmid containing a first nucleotide sequence 
encoding an anti-HIV ribozyme and (ii) a plasmid containing a third nucleotide sequence 
encoding a modified HIV gag.pol construct which cannot be cleaved by the anti-HIV 

30 ribozyme. Optional components would then be (a) an HIV viral genome construct with 
suitable restriction enzyme recognition sites for cloning the first nucleotide sequence into 
the viral genome; (b) a plasmid encoding a VSV-G env protein. Alternatively, nucleotide 
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sequence encoding viral polypeptides required for assembly of viral particles may he 
provided in the kit as packaging cell lines comprising the nucleotide sequences, lor 
example a VSV-G expressing cell line. 

< The term "viral vector production system" refers to the viral vector system described above 
wherein the first nucleotide sequence has already been inserted into a suitable viral vector 
genome. 

Viral vectors are typically retroviral vectors, in particular lentiviral vectors such as HIV 
10 vectors. The retroviral vector of the present invention may be derived from or may be 
derivable from any suitable retrovirus. A large number of different retroviruses have been 
identified. Examples include: murine leukemia virus (MLV). human immunodeficiency 
virus (HIV), simian immunodeficiency virus, human T-cell leukemia virus (HTLV). 
equine infectious anaemia virus (EIAV), mouse mammary tumour virus (MMTV), Rous 
15 sarcoma virus (RSV), Fujinami sarcoma virus (FuSV), Moloney murine leukemia virus 
(Mo-MLV), FBR murine osteosarcoma virus (FBR MSV), Moloney murine sarcoma virus 
(Mo-MSV), AbeLson murine leukemia virus (A-MLV), Avian myelocytomatosis virus-29 
(MC29), and Avian erythroblastosis virus (AEV). A detailed list of retroviruses may be 
found in Coffin et al, 1997, "Retroviruses", Cold Spring Harbour Laboratory Press Eds: 
20 JM Coffin. SM Hughes, HE Varmus pp 758-763. 

Details on the genomic structure of some retroviruses may be found in the art. By way of 
example, details on HIV and Mo-MLV may be found from the NCB1 Genbank (Genome 
Accession Nos. AF033819 and AF033811, respectively). 

25 

The lentivirus group can be split even further into "primate" and "non-primate". Examples 
of primate lentiviruses include human immunodeficiency virus (HIV), the causative agent 
of human auto-immunodeficiency syndrome (AIDS), and simian immunodeficiency virus 
(SIV). The non-primate lentiviral group includes the prototype "slow virus 1 visna/maedi 
30 virus (VMV), as well as the related caprine arthritis-encephalitis virus (CAEV), equine 
infectious anaemia virus (EIAV) and the more recently described feline immunodeficiency 
virus (FIV) and bovine immunodeficiency virus (BIV). 
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The basic structure of a retrovirus genome is a 5" LTR and a 3* LTR. between or within 
which are located a packaging signal to enable the genome to be packaged, a primer 
binding site, integration sites to enable integration into a host cell genome and gag. pol and 
5 env genes encoding the packaging components - these are polypeptides required for the 
assembly of viral particles. More complex retroviruses have additional features, such as 
rev and RRE sequences in HIV. which enable the efficient export of RNA transcripts of the 
integrated provirus from the nucleus to the cytoplasm of an infected target cell. 

10 In the provirus, these genes are flanked at both ends by regions called long terminal repeats 
(LTRs). The LTRs are responsible for proviral integration, and transcription. LTRs also 
serve as enhancer-promoter sequences and can control the expression of the viral genes. 
Encapsidation of the retroviral RNAs occurs by virtue of a psi sequence located at the 5" 
end of the viral genome. 

15 

The LTRs themselves are identical sequences that can be divided into three elements, 
which are called U3, R and U5. U3 is derived from the sequence unique to the 3' end of 
the RNA. R is derived from a sequence repeated at both ends of the RNA and U5 is 
derived from the sequence unique to the 5 1 end of the RNA. The sizes of the three 
20 elements can vary considerably among different retroviruses. 

In a defective retroviral vector genome gag, pol and env may be absent or not functional. 
The R regions at both ends of the RNA are repeated sequences. U5 and U3 represent 
unique sequences at the 5' and 3' ends of the RNA genome respectively. 

25 

In a typical retroviral vector for use in gene therapy, at least part of one or more of the gag, 
pol and env protein coding regions essential for replication may be removed from the virus. 
This makes the retroviral vector replication-defective. The removed portions may even be 
replaced by a nucleotide sequence of interest (NOI), such as a first nucleotide sequence of 
30 the invention, to generate a virus capable of integrating its genome into a host genome but 
wherein the modified viral genome is unable to propagate itself due to a lack of structural 
proteins. When integrated in the host genome, expression of the NOI occurs - resulting in. 
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for example, a therapeutic and 'or a diagnostic effect. Thus, the transfer of an NOI into a 
sue of interest is typically achieved by: integrating the NOI into the recombinant viral 
vector; packaging the modified viral vector into a virion coat: and allowing transduction of 
a site of interest - such as a targeted cell or a targeted cell population. 

A minimal retroviral genome for use in the present invention will therefore comprise (5\) R 
- I "5 - one or more first nucleotide sequences - U3-R (3*). However, the plasmid vector 
used to produce the retroviral genome within a host cell/packaging cell will also include 
transcriptional regulatory control sequences operably linked to the retroviral genome to 
10 direct transcription of the genome in a host cell/packaging cell. These regulatory 
sequences may be the natural sequences associated with the transcribed retroviral sequence, 
i.e. the 5' U3 region, or they may be a heterologous promoter such as another viral 
promoter, for example the CMV promoter. 

15 Some retroviral genomes require additional sequences for efficient virus production. For 
example, in the case of HIV, rev and RRE sequence are preferably included. However the 
requirement for rev and RRE can be reduced or eliminated by codon optimisation. 

Once the retroviral vector genome is integrated into the genome of its target cell as proviral 
20 DNA, the ribozymc sequences need to be expressed. In a retrovirus, the promoter is 
located in the 5' LTR U3 region of the provirus. In retroviral vectors, the promoter driving 
expression of a therapeutic gene may be the native retroviral promoter in the 5* U3 region, 
or an alternative promoter engineered into the vector. The alternative promoter may 
physically replace the 5' U3 promoter native to the retrovirus, or it may be incorporated at 
25 a different place within the vector genome such as between the LTRs. 

Thus, the first nucleotide sequence will also be operably linked to a transcriptional 
regulatory control sequence to allow transcription of the first nucleotide sequence to occur 
in the target cell. The control sequence will typically be active in mammalian cells. The 
30 control sequence may, for example, be a viral promoter such as the natural viral promoter 
or a CMV promoter or it may be a mammalian promoter. It is particularly preferred to use 
a promoter that is preferentially active in a particular cell type or tissue type in which the 
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virus to be treated primarily infects. Thus, in one embodiment, a tissue-specific regulatory 
sequences may be used. The regulatory control sequences driving expression of the one or 
more first nucleotide sequences may be constitutive or regulated promoters. 

5 Replication-defective retroviral vectors are typically propagated, for example to prepare 
suitable litres of the retroviral vector for subsequent transduction, by using a combination 
of a packaging or helper cell line and the recombinant vector. That is to say, that the three 
packaging proteins can be provided in trans. 



10 A 'packaging cell line" contains one or more of the retroviral gag, pal and env genes. The 
packaging cell line produces the proteins required for packaging retroviral DNA but it 
cannot bring about encapsidation due to the lack of a psi region. However, when a 
recombinant vector carrying an NOl and a psi region is introduced into the packaging cell 
line, the helper proteins can package the /^/-positive recombinant vector to produce the 

15 recombinant virus stock. This virus stock can be used to transduce cells to introduce the 
NOI into the genome of the target cells. It is preferred to use a psi packaging signal, called 
psi plus, that contains additional sequences spanning from upstream of the splice donor to 
downstream of the gag start codon (Bender et al. (46)) since this has been shown to 
increase viral titres. 

20 

The recombinant virus whose genome lacks all genes required to make viral proteins can 
tranduce only once and cannot propagate. These viral vectors which are only capable of a 
single round of transduction of target cells are known as replication defective vectors. 
Hence, the NOI is introduced into the host/target cell genome without the generation of 
25 potentially harmful retrovirus. A summary of the available packaging lines is presented in 
Coffin etal., 1997 (ibid). 



Retroviral packaging cell lines in which the gag, pol and env viral coding regions are 
carried on separate expression plasmids that are independently transfected into a packaging 
30 cell line are preferably used. This strategy, sometimes referred to as the three plasmid 
transfection method (Soneoka et al. (33)), reduces the potential for production of a 
replication-competent virus since three recombinant events are required for wild type viral 
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production. As recombination is greatly facilitated by homology, reducing or eliminating 
homology between the genomes of the vector and the helper can also be used to reduce the 
problem of replication-competent helper virus production. 

5 An alternative to stably transfected packaging cell lines is to use transiently transfected cell 
lines. Transient transfections may advantageously be used to measure levels of vector 
production when vectors are being developed. In this regard, transient transfection avoids 
the longer time required to generate stable vector-producing cell lines and may also be used 
if the vector or retroviral packaging components are toxic to cells. Components typically 

10 used to generate retroviral vectors include a plasmid encoding the gag/pol proteins, a 
plasmid encoding the env protein and a plasmid containing an NOI. Vector production 
involves transient transfection of one or more of these components into cells containing the 
other required components. If the vector encodes toxic genes or genes that interfere with 
the replication of the host cell, such as inhibitors of the cell cycle or genes that induce 

\5 apotosis, it may be difficult to generate stable vector-producing cell lines, but transient 
transfection can be used to produce the vector before the cells die. Also, cell lines have 
been developed using transient transfection that produce vector titre levels that are 
comparable to the levels obtained from stable vector-producing cell lines (Pear et al. (47)). 

20 Producer cells/packaging cells can be of any suitable cell type. Most commonly, 
mammalian producer cells are used but other cells, such as insect cells are not excluded. 
Clearly, the producer cells will need to be capable of efficiently translating the env and 
gag, pol mRNA. Many suitable producer/packaging cell lines are known in the art. The 
skilled person is also capable of making suitable packaging cell lines by, for example 

25 stably introducing a nucleotide construct encoding a packaging component into a cell line. 

As will be discussed below, where the retroviral genome encodes an inhibitory RNA 
molecule capable of effecting the cleavage of gag, pol and/or env RNA transcripts, the 
nucleotide sequences present in the packaging cell line, either integrated or carried on 
30 plasmids, or in the transiently transfected producer cell line, which encode gag. pol and or 
env proteins will be modified so as to reduce or prevent binding of the inhibitory RNA 
molecule(s). In this way, the inhibitory RNA molecule(s) will not prevent expression of 
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components in packaging cell lines that arc essentia! for packaging of viral particles. 

It is highly desirable to use high-titre virus preparations in both experimental and practical 
applications. Techniques for increasing viral titre include using a psi plus packaging signal 
5 as discussed above and concentration of viral stocks. In addition, the use of different 
envelope proteins, such as the G protein from vesicular-stomatitis virus has improved litres 
following concentration to 10 9 per ml (Cosset ct al (48)). However, typically the envelope 
protein will be chosen such that the viral particle will preferentially infect cells that are 
infected with the virus which it desired to treat. For example where an HIV vector is being 
10 used to treat HIV infection, the env protein used will be the HIV env protein. 

Suitable first nucleotide sequences for use according to the present invention encode gene 
products that result in the cleavage and/or enzymatic degradation of a target nucleotide 
sequence, which will generally be a ribonucleotide. As particular examples, ribozymes, 
15 and antisense sequences may be mentioned. 

Ribozymes are RNA enzymes which cleave RNA at specific sites. Ribozymes can be 
engineered so as to be specific for any chosen sequence containing a ribozyme cleavage 
site. Thus, ribozymes can be engineered which have chosen recognition sites in transcribed 

20 viral sequences. By way of an example, ribozymes encoded by the first nucleotide 
sequence recognise and cleave essential elements of viral genomes required for the 
production of viral particles, such as packaging components. Thus, for retroviral genomes, 
such essential elements include the gag, pol and env gene products. A suitable ribozyme 
capable of recognising at least one of the gag, pol and env gene sequences, or more 

25 typically, the RNA sequences transcribed from these genes, is able to bind to and cleave 
such a sequence. This will reduce or prevent production of the gal, pol or env protein as 
appropriate and thus reduce or prevent the production of retroviral particles. 

Ribozymes come in several forms, including hammerhead, hairpin and hepatitis delta 
30 antigenomic ribozymes. Preferred for use herein are hammerhead ribozymes, in part 
because of their relatively small size, because the sequence requirements for their target 
cleavage site are minimal and because they have been well characterised. The ribozymes 
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most commonly used in research at present are hammerhead and hairpm ribozymes. 



bach individual ribozyme has a motif which recognises and binds to a recognition site m 
the target RNA. This motif takes the form of one or more "binding arms", generally two 
binding arms. The binding arms in hammerhead ribozymes are the Hanking sequences 
Helix I and Helix 111. which flank Helix II. These can be of variable length, usually 
between 6 to 10 nucleotides each, but can be shorter or longer, The length of the flanking 
sequences can affect the rate of cleavage. For example, it has been found that reducing the 
total number of nucleotides in the flanking sequences from 20 to 12 can increase the 
turnover rate of the ribozyme cleaving a HIV sequence, by 10-fold (44). A catalytic motif 
in the ribozyme Helix II in hammerhead ribozymes cleaves the target RNA at a site which 
is referred to as the cleavage site. Whether or not a ribozyme will cleave any given RNA is 
determined by the presence or absence of a recognition site for the ribozyme containing an 
appropriate cleavage site. 



Each type of ribozyme recognises its own cleavage site. The hammerhead ribozyme 
cleavage site has the nucleotide base triplet GUX directly upstream where G is guanine, U 
is uracil and X is any nucleotide base. Hairpin ribozymes have a cleavage site of 
BCUGNYR, where B is any nucleotide base other than adenine, N is any nucleotide, Y is 
20 cytosine or thymine and R is guanine or adenine. Cleavage by hairpin ribozymes takes 
places between the G and the N in the cleavage site. 



The nucleic acid sequences encoding the packaging components (the "third nucleotide 
sequences") may be resistant to the ribozyme or ribozymes because they lack any cleavage 

25 sites for the ribozyme or ribozymes. This prohibits enzymatic activity by the ribozyme or 
ribozymes and therefore there is no effective recognition site for the ribozyme or 
ribozymes, Alternatively or additionally, the potential recognition sites may be altered in 
the Hanking sequences which form the part of the recognition site to which the ribozyme 
binds. This either eliminates binding of the ribozyme motif to the recognition site, or 

30 reduces binding capability enough to destabilise any ribozyme-target complex and thus 
reduce the specificity and catalytic activity of the ribozyme. Where the Hanking sequences 
only are altered, they are preferably altered such that catalytic activity of the ribozyme at 
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the altered target sequence is negligible and is effectively eliminated. 

Preferably, a series of several anti-HIV ribozymes is employed in the invention (5. 7, 10. 
13, 21, 36, 38, 40). These can be any anti-IIIV ribozymes but must include one or more 
5 which cleave the RNA that is required for the expression of gag, pol or env. Preferably, a 
plurality of ribozymes is employed, together capable of cleaving gag, pol and env RNA of 
the native retrovirus at a plurality of sites. Since HIV exists as a population of 
quasispecies, not all of the target sequences for the ribozymes will be included in all HIV 
variants. The problem presented by this variability can be overcome by using multiple 

10 ribozymes. Multiple ribozymes can be included in series in a single vector and can 
function independently when expressed as a single RNA sequence. A single RNA 
containing two or more ribozymes having different target recognition sites may be referred 
to as a multitarget ribozyme. The placement of ribozymes in series has been demonstrated 
to enhance cleavage. The use of a plurality of ribozymes is not limited to treating HIV 

15 infection but may be used in relation to other viruses, retroviruses or otherwise. 

Antisense technology is well known on the art. There are various mechanisms by which 
antisense sequences are believed to inhibit gene expression. One mechanism by which 
antisense sequences are believed to function is the recruitment of the cellular protein 

20 RNAseH to the target sequence/anti sense construct heteroduplex which results in cleavage 
and degradation of the heteroduplex. Thus the antisense construct, by contrast to 
ribozymes, can be said to lead indirectly to cleavage/degradation of the target sequence. 
Thus according to the present invention, a first nucleotide sequence may encode an 
antisense RNA that binds to either a gene encoding an essential/packaging component or 

25 the RNA transcribed from said gene such that expression of the gene is inhibited, for 
example as a result of RNAseH degradation of a resulting heteroduplex. It is not necessary 
for the antisense construct to encode the entire complementary sequence of the gene 
encoding an essential/packaging component - a portion may suffice. The skilled person 
will easily be able to determine how to design a suitable antisense construct. 

30 

By contrast, the nucleic acid sequences encoding the essential/packaging components of 
the viral particles required for the assembly of viral particles in the host cells/producer 
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cells packaging cells (the third nucleotide sequences) are resistant to the inhibitory RNA 
molecules encoded by the first nucleotide sequence, for example in the case of ribozymes. 
resistance is typically by virtue of alterations in the sequences which eliminate the 
ribo/yme recognition sites. At the same time, the amino acid coding sequence for the 
5 essential-packaging components is retained so that the viral components encoded by the 
sequences remain the same, or at least sufficiently similar that the function of the 
essential/packaging components is not compromised. 

The term "viral polypeptide required for the assembly of viral particles" means a 
10 polypeptide normally encoded by the viral genome to be packaged into viral particles, in 
the absence of which the viral genome cannot be packaged. For example, in the context of 
retroviruses such polypeptides would include gag, pol and env. The terms "packaging 
component" and '"essential component" are also included within this definition. 

15 In the case of anti sense sequences, the third nucleotide sequence differs from the second 
nucleotide sequence encoding the target viral packaging component antisense sequence to 
the extent that although the antisense sequence can bind to the second nucleotide sequence, 
or transcript thereof, the antisense sequence can not bind effectively to the third nucleotide 
sequence or RNA transcribed from therefrom. The changes between the second and third 

20 nucleotide sequences will typically be conservative changes, although a small number ot 
amino acid changes may be tolerated provided that, as described above, the function of the 
essential/packaging components is not significantly impaired. 

Preferably, in addition to eliminating the ribozyme recognition sites, the alterations to the 
25 coding sequences for the viral components improve the sequences for codon usage in the 
mammalian cells or other cells which are to act as the producer cells for retroviral vector 
particle production. This improvement in codon usage is referred to as "codon 
optimisation". Many viruses, including HIV and other lentiviruses, use a large number ot 
rare codons and by changing these to correspond to commonly used mammalian codons. 
30 increased expression of the packaging components in mammalian producer cells can be 
achieved. Codon usage tables are known in the art for mammalian cells, as well as tor a 
variety of other organisms. 
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Thus preferably, the sequences encoding the packaging components are codon optimised. 
More preferably, the sequences are codon optimised in their entirety. Following codon 
optimisation, it is found that there are numerous sites in the wild type gag, pol and env 
5 sequences which can serve as ribozyme recognition sites and which are no longer present 
in the sequences encoding the packaging components. In an alternative but less practical 
strategy, the sequences encoding the packaging components can be altered by targeted 
conservative alterations so as to render them resistant to selected ribozymes capable of 
cleaving the wild type sequences. 

10 

An additional advantage of codon optimising HIV packaging components is that this can 
increase gene expression. In particular, it can render gag, pol expression Rev independent 
so that rev and RRE need not be included in the genome (11). Rev-independent vectors are 
therefore possible. This in turn enables the use of anti-rev or RRE factors in the retroviral 
15 vector. 

As described above, the packaging components for a retroviral vector include expression 
products of gag, pol and env genes. In accordance with the present invention, gag and pol 
employed in the packaging system are derived from the target retrovirus on which the 

20 vector genome is based. Thus, in the RNA transcript form, gag and pol would normally be 
cleavable by the ribozymes present in the vector genome. The env gene employed in the 
packaging system may be derived from a different virus, including other retroviruses such 
as MLV and non-retroviruses such as VSV (a Rhabdo virus), in which case it may not need 
any sequence alteration to render it resistant to ribozyme cleavage. Alternatively, env may 

25 be derived from the same retrovirus as gag and pol, in which case any recognition sites for 
the ribozymes will need to be eliminated by sequence alteration. 

The process of producing a retroviral vector in which the envelope protein is not the native 
envelope of the retrovirus is known as "pseudotyping". Certain envelope proteins, such as 
30 MLV envelope protein and vesicular stomatitis virus G (VSV-G) protein, pseudotype 
retroviruses very well. Pseudotyping can be useful for altering the target cell range of the 
retrovirus. Alternatively, to maintain target cell specificity for target cells infected with the 
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particular virus it is desired to treat, the envelope protein may be the same as that ol the 
target virus, tor example HIV. 

Other therapeutic coding sequences may be present along with the first nucleotide 
5 sequence or sequences. Other therapeutic coding sequences include, but are not limited to. 
sequences encoding cytokines, hormones, antibodies, immunoglobulin fusion proteins, 
enzymes, immune co-stimulatory molecules, anti-sense RNA. a transdominant negative 
mutant of a target protein, a toxin, a conditional toxin, an antigen, a single chain antibody, 
tumour suppresser protein and growth factors. When included, such coding sequences are 
10 operatively linked to a suitable promoter, which may be the promoter driving expression of 
the first nucleotide sequence or a different promoter or promoters. 

Thus the invention comprises two components. The first is a genome construction that will 
be packaged by viral packaging components and which carries a series of anti-viral 

15 inhibitory RNA molecules such as anti-HIV ribozymes (5, 7, 10, 13, 21, 36, 38, 40). These 
could be any anti-HIV ribozymes but the key issue for this invention is that some of them 
cleave RNA that is required for the expression of native or wild type HIV gag, pal or env 
coding sequences. The second component is the packaging system which comprises a 
cassette for the expression of HIV gag, pol and a cassette either for HIV env or an envelope 

20 gene encoding a pseudotyping envelope protein - the packaging system beig resistant to the 
inhibitory RNA molecules. 



The viral particles of the present invention, and the viral vector system and methods used 
to produce may thus be used to treat or prevent viral infections, preferably retroviral 
25 infections, in particular lentiviral. especially HIV, infections. Specifically, the viral 
particles of the invention, typically produced using the viral vector system of the present 
invention may be used to deliver inhibitory RNA molecules to a human or animal in need 
of treatment for a viral infection. 

30 Alternatively, or in addition, the viral production system may be used to transfect cells 
obtained from a patient ex vivo and then returned to the patient. Patient cells transfected ex 
vivo may be formulated as a pharmaceutical composition (see below) prior to 
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Preferably the viral particles are combined with a pharmaceuticals acceptable carrier or 
diluent to produce a pharmaceutical composition. Thus, the present invention also provides 
5 a pharmaceutical composition for treating an individual, wherein the composition 
comprises a therapeutically effective amount of the viral particle of the present invention, 
together with a pharmaceutical^ acceptable carrier, diluent, excipient or adjuvant. The 
pharmaceutical composition may be for human or animal usage. 

to The choice of pharmaceutical carrier, excipient or diluent can be selected with regard to the 
intended route of administration and standard pharmaceutical practice. Suitable carriers and 
diluents include isotonic saline solutions, for example phosphate- buffered saline. The 
pharmaceutical compositions may comprise as - or in addition to - the carrier, excipient or 
diluent any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), 

15 solubilising agent(s), and other carrier agents that may aid or increase the viral entry into 
the target site (such as for example a lipid delivery system). 

The pharmaceutical composition may be formulated for parenteral, intramuscular, 
intravenous, intracranial, subcutaneous, intraocular or transdermal administration. 

20 

Where appropriate, the pharmaceutical compositions can be administered by any one or 
more of: inhalation, in the form of a suppository or pessary, topically in the form of a 
lotion, solution, cream, ointment or dusting powder, by use of a skin patch, orally in the 
form of tablets containing excipients such as starch or lactose, or in capsules or ovules 

25 either alone or in admixture with excipients, or in the form of elixirs, solutions or 
suspensions containing flavouring or colouring agents, or they can be injected parenterally, 
for example intracavernosally, intravenously, intramuscularly or subcutaneously. For 
parenteral administration, the compositions may be best used in the form of a sterile 
aqueous solution which may contain other substances, for example enough salts or 

30 monosaccharides to make the solution isotonic with blood. For buccal or sublingual 
administration the compositions may be administered in the form of tablets or lozenges 
which can be formulated in a conventional manner. 



BNSDOCIO <WO 9941397A1 I > 



WO 99/41397 



-19- 



PCT/GB99/00325 



The amount of virus administered is typically in the range of from 10 to 10' plu. 
preferably from 10' to 10 s pfu. more preferably from K)'' to 10' plu. When injected, 
typically I -10 ul of virus in a pharmaceutical^ acceptable suitable carrier or diluent is 
administered. 

When the polynucleotide/vector is administered as a naked nucleic acid, the amount ol 
nucleic acid administered is typically in the range of from 1 ug to 10 mg, preferably from 
100 ug to 1 mg. 

Where the first nucleotide sequence (or other therapeutic sequence) is under the control of 
an inducible regulatory sequence, it may only be necessary to induce gene expression for 
the duration of the treatment. Once the condition has been treated, the inducer is removed 
and expression of the NOI is stopped. This will clearly have clinical advantages. Such a 
system may, for example, involve administering the antibiotic tetracycline, to activate gene 
expression via its effect on tf\e tet repressor/VP16 fusion protein. 

The invention will now be further described by way of Examples, which are meant to serve 
to assist one of ordinary 7 skill in the art in carrying out the invention and are not intended in 
any way to limit the scope of the invention. The Examples refer to the Figures. In the 
Figures: 

Figure 1 shows schematically ribozymes inserted into four different HIV vectors; 

Figure 2 shows schematically how to create a suitable 3' LTR by PCR; 

Figure 3 shows the codon usage table for wild type HIV ga&pol of strain HXB2 (accession 
number: K03455). 

Figure 4 shows the codon usage table of the codon optimised sequence designated gag,pol- 
SYNgp. 
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I ; igure 5 shows the codon usage table of the wild type I II V cm- called env-mn. 

Figure 6 shows the codon usage table of the codon optimised sequence of HIV env 
5 designated SYNgpl60mn. 

Figure 7 shows three plasmid constructs for use in the invention. 

Figure 8 shows the principle behind two systems for producing retroviral vector particles. 

10 

The invention will now be further described in the Examples which follow, which are 
intended as an illustration only and do not limit the scope of the invention. 

EXAMPLES 

15 

Example 1 - Construction of a Genome 

The HIV gag.pol sequence was codon optimised (Figure 4 and SEQ I.D. No. 1) and 
synthesised using overlapping oligos of around 40 nucleotides. This has three advantages. 
20 Firstly it allows an HIV based vector to carry ribozymes and other therapeutic factors. 
Secondly the codon optimisation generates a higher vector titre due to a higher level of 
gene expression. Thirdly gag.pol expression becomes rev independent which allows the 
use of anti-rev or RRE factors. 

25 Conserved sequences within gag.pol were identified by reference to the HIV Sequence 
database at Los Alamos National Laboratory (http:// hiv-web.lanl.gov/) and used to design 
ribozymes. Because of the variability between subtypes of HIV-1 the ribozymes were 
designed to cleave the predominant subtype within North America, Latin America and the 
Caribbean, Europe, Japan and Australia; that is subtype B. The sites chosen were cross- 

30 referenced with the synthetic gagpol sequence to ensure that there was a low possibility of 
cutting the codon optimised gagpol mRNA. The ribozymes were designed with Xho\ and 
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Sull sites at the 5' and V end respectively. "I his allows the construction of separate and 
tandem ribozymes. 

The ribozymes are hammerhead {25) structures ol the following general structure: 

Helix I Helix II Helix III 

5 ' - WNNNNNN C L ' G A I X j A G G C C G A A A G G C C G A A NNNNNNNN 



The catalytic domain of the ribozyme (Helix II) can tolerate some changes without 
10 reducing catalytic turnover. 

The cleavage sites, targeting gag and pol with the essential GUX triplet (where X is any 
nucleotide base) are as follows: 



GAG 


1. 


5 1 


"JAGUAAGAAUGUAUAGCCCUAC 


GAG 




5 1 


AAC CCAGAUUGUAAGACUAUUU 


GAG 


3 


5 ' 


UGUUUCAAUUGUGGCAAAGAAG 


GAG 


4 


5 1 


AAAAAGGG CUGUUGG AAAUGU 1 j 


POL 


1 


5 ' 


ACGACCCCUCGUCACAAUAAAG 


POL 




5 ' 


GGAAUUGGAGGUUUUAUCAAAG 


POL 


"i 


5 ' 


AUAUUUUUCAGUUCCCUUAGAU 


POL 


<\ 


5 1 


UGG AUG AUUU G U AUGUAGG AU C 


POL 




5 ' 


CUUUGGAUGGGUUAUGAACUCC 


POL 


6 


5 1 


CAGCUGGACUGUCAAUGACAUA 


POL 




5 i 


AACUTJUCUAUGUAGAUGGGGCA 


POL 


8 


5 ' 


AAGGCCGCCUGUUGGUGGGCAG 


POL 


9 




UAAGACAGCAGUACAAAUGGCA 



The ribozymes are inserted into four different HIV vectors (pi 14 (10), pH6, pH4.1, or 
30 pH6. 1 ) (Figure 1 ). In pH4 and pH6, transcription of the ribozymes is driven by an internal 
HCMV promoter (9). From pH4.1 and pH6.1. the ribozymes are expressed from the 5' 
LTR. The major difference between pH4 and pH6 (and pH4.l and pH6.1 ) resides in the 3' 
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LTR in the production plasmid. pH4 and pi 14.1 have the HIV U3 in the 3' LTR. pi 16 and 
pH6.I have HCMV in the 3' LTR. The HCMV promoter replaces most of the L3 and will 
drive expression at high constitutive levels while the HIV-1 U3 will support a high level of 
expression only in the presence of Tat. 

5 

The HCMV/HIV-1 hybrid 3 ! LTR is created by recombinant PGR with three PGR primers 
(Figure 2). The first round of PGR is performed with RIB1 and RIB2 using pH4 (12) as 
the template to amplify the HIV-1 HXB2 sequence 8900-9123. The second round of PGR 
makes the junction between the 5' end of the HIV-1 U3 and the HCMV promoter by 
in amplifying the hybrid 5' LTR from pH4. The PGR product from the first PGR reaction and 
RIB 3 serves as the 5' primer and 3' primer respectively. 

F.IB1 : 5 ' -CAGCTGCTCGAGCAGCTGAAGCTTGCATGC - 3 ' 
RIB2 : 5 ' - GTAAGTTATGTAACGGACGATATCTTGTCTTCTT - 3 ' 
15 RIB3 : 5' - CGCATAGTCGACGGGCCCGCCACTGCTAGAGATTTTC- 3 ' 

The PCR product is then cut with Sphl and Sail and inserted into pH4 thereby replacing the 
3' LTR. The resulting plasmid is designated pH6. To construct pH4.1 and pH6.1, the 
internal HCMV promoter (Spel - Xhol) in pH4 and pH6 is replaced with the polycloning 
20 site of pBluescript II KS+ (Stratagene) (Spel - Xhol). 

The ribozymes are inserted into the Xhol sites in the genome vector backbones. Any 
ribozymes in any configuration could be used in a similar way. 

25 Example 2 - Construction of a Packaging System 

The packaging system can take various forms. In a first form of packaging system, the 
HIV gag, pol components are co-expressed with the HIV env coding sequence. In this 
case, both the gag, pol and the env coding sequences are altered such that they are resistant 
30 to the anti-HIV ribozymes that are built into the genome. At the same time as altering the 
codon usage to achieve resistance, the codons can be chosen to match the usage pattern of 
the most highly expressed mammalian genes. This dramatically increases expression 
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Icvcls (28. 2 C )) and so increases titre. A codon optimised HIV env coding sequence has 
been described by Haas c( al (9). In the present example, a modified codon optimised IIIY 
env sequence is used (SEQ I.D. No. 3). The corresponding env expression plasmid is 
designated pSYNgpl 60mn. The modified sequence contains extra motifs not used by Haas 
cl al. The extra sequences were taken from the HIV env sequence of strain MN and codon 
optimised. Any similar modification of the nucleic acid sequence would function similarly 
as long as it used codons corresponding to abundant tRNAs [42) and lead to resistance to 
the ribozymes in the genome. 



10 In one example of a gag, pol coding sequence with optimised codon usage, overlapping 
oligonucleotides are synthesised and then ligated together to produce the synthetic coding 
sequence. The sequence of a wild-type (Genbank accession no. K03455) and synthetic 
(gagpol-SYNgp) gagpol sequence is shown in SEQ I.D. Nos 1 and 2, respectively and their 
codon usage is shown in Figures 3 and 4, respectively. The sequence of a wild type env 

l? coding sequence (Genbank Accession No. Ml 7449) is given in SEQ I.D. No 3, the 
sequence of a synthetic codon optimised sequence is given in SEQ. I.D. No. 4 and their 
codon usage tables are given in Figures 5 and 6, respectively. As with the env coding 
sequence any gag, pol sequence that achieves resistance to the ribozymes could be used. 
The synthetic sequence shown is designated gag, pol-SYNgp and has an EcoRl site at the 5' 

20 end and a Not\ site at the 3' end. It is inserted into pClneo (Promega) to produce plasmid 
pSYNgp- 

In a second form of the packaging system a synthetic gag, pol cassette is coexpressed with 
a non-HIV envelope coding sequence that produces a surface protein that pseudotypes 

25 HIV. This could be for example VSV-G (20. 41). amphotropic MTV env (6, 34) or any 
other protein that would be incorporated into the HIV particle (37). This includes 
molecules capable of targeting the vector to specific tissues. Coding sequences for non- 
HIV envelope proteins not cleaved by the ribozymes and so no sequence modification is 
required (although some sequence modification may be desirable for other reasons such as 

30 optimisation for codon usage in mammalian cells). 
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Vector particles can be produced either from a transient three-plasmid transfection system 
similar to that described by Soneoka et al (33) or from producer ceil lines similar to those 
> used for other retroviral vectors (20, 35, 39). These principles are illustrated in Figures 7 
and S. For example, by using pFI6Rz. pSYNgp and pRV67 (VSV-G expression plasmid) 
in a three plasmid translection of 293T cells (Figure 8), as described by Soneoka et al (33), 
vector particles designated H6Rz-VSV are produced. These transduce the H6Rz genome 
to CD4+ cells such as CI 866 or Jurkat and produce the multitarget ribozymes. HIV 
10 replication in these cells is now severely restricted. 

All publications mentioned in the above specification are herein incorporated by reference. 
Various modifications and variations of the described methods and system of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
1 5 invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly 
limited to such specific embodiments. Indeed, various modifications of the described 
modes for carrying out the invention which are obvious to those skilled in molecular 
biology or related fields are intended to be within the scope of the following claims. 

20 
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1 . A viral vector system comprising: 

(i) a first nucleotide sequence encoding a gene product capable of binding to and 
effecting the cleavage, directly or indirectly, of a second nucleotide sequence, or 
transcription product thereof, encoding a viral polypeptide required for the assembly of 
viral particles; and 

(ii) a third nucleotide sequence encoding said viral polypeptide required for the 
assembly of viral particles, which third nucleotide sequence has a different nucleotide 
sequence to the second nucleotide sequence such that the third nucleotide sequence, or 
transcription product thereof, is resistant to cleavage directed by said gene product. 

2. A viral vector production system comprising: 

I i) a viral genome comprising at least one first nucleotide sequence encoding a gene 
product capable of binding to and effecting the cleavage, directly or indirectly, of a second 
nucleotide sequence, or transcription product thereof, encoding a viral polypeptide required 
for the assembly of viral particles; 

(ii) a third nucleotide sequence encoding said viral polypeptide required for the 
assembly of the viral genome into viral particles, which third nucleotide sequence has a 
different nucleotide sequence to the second nucleotide sequence such that said third 
nucleotide sequence, or transcription product thereof, is resistant to cleavage directed by 
said gene product. 

3. A system according to claim 1 or 2 wherein the gene product is selected from a 
ribozyme and an anti-sense ribonucleic acid. 

4. A system according to any one of claims 1 to 3 wherein the viral vector is a 
retroviral vector. 

5. A system according to claim 4 wherein the retroviral vector is a lentiviral vector. 

6. A system according to claim 5 wherein the lentiviral vector is an 1 IIV vector. 
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7. A system according to any one of claims 4 to 6 wherein the polypeptide required 
for the assembly of viral particles is selected from gag. pol and env proteins. 

X. A svstem according to claim 7 wherein at least the gag and pol proteins are from a 
lentivirus. 

9. A system according to claim 7 wherein the env protein is from a lentivirus. 

1 0. A system according to claim 8 or 9 wherein the lentivirus is HIV. 

11. A system according to any one of the preceding claims wherein the third nucleotide 
sequence is resistant to cleavage directed by the gene product as a result of one or more 
conservative alterations in the nucleotide sequence which remove cleavage sites recognised 
by the at least one gene product and/or binding sites for the at least one gene product 

12. A system according to any one of claims 1 to 10 wherein the third nucleotide 
sequence is adapted to be resistant to cleavage by the at least one gene product. 

13. A system according to any one of the preceding claims wherein the third nucleotide 
sequence is codon optimised for expression in producer cells. 

14. A system according to claim 13, wherein the producer cells are mammalian cells. 

15. A system according to any one of the preceding claims comprising a plurality ot 
first nucleotide sequences and third nucleotide sequences as defined therein. 

16. A viral particle comprising a viral vector genome as defined in any one of claims 2 
to 1 5 and one or more third nucleotide sequences as defined in any of claims 2 to 15. 

17. A viral particle produced using a viral vector production system according to any 
one of claims 2 to 1 5. 
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1 X. A method lor producing a viral particle which method comprises introducing into a 
host cell (i) a viral genome as defined in any one of claims 2 to 15 (ii) one or more third 
nucleotide sequences as defined in any of claims 2 to 15 and (iii) nucleotide sequences 
encoding the other essential viral packaging components not encoded by the one or more 
third nucleotide sequences. 

19. A viral particle produced by the method of claim 18. 

20. A pharmaceutical composition comprising a viral particle according to claims 16, 
1 7 or 19 together with a pharmaceutically acceptable carrier or diluent. 

21. A viral system according to any one of claims 1 to 16 or a viral particle according 
to claims 16, 17 or 19 in treating a viral infection. 

22. A viral system according to any one of claims 1 to 16 for use in a method of 
producing viral particles. 
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Figure 3 
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gagpol-SYNgp [1 to 4308] -> Codon Usage 

DNA sequence 4308 b.p. ATGGGCGCCCGC .... GATGAGGATTAG linear 
143 6 codons 

KW : 161929 Dalton CAI(S.c) : 0.080 CAI(E.c) : 0.296 
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SYNgplSOmn -> Codon Usage 

2571 b.p. ATGAGGGTGAAG ... GCGCTGCTGTAA linear 
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SFOUFN CF LISTING PART OF THF. DESCRIPTION 



SEQ- ID. NO. 1 - Wild type gagpol sequence for strain HXB2 (accession no. K03455) 

A1GGGTGC2A G/«XGTCAGT ATTAAGCGGG Gi AGAATTAG ATCGATGGGA AAAAATTCGG * 0 
"[TA--GGXX GG jGAAAGAA ;aaA"ta:a;a T-iA'ACA'/ TAGTATGGjC AAGCAGGGAG GO 
C FAGAACGM Ti. XAGTTAA TXTGGCCTG T"^AAACA t CAGAAGG7 FG 1AGACAAA1A i AO 
GTGGGACAX TAG AAC CATC CCTTCAGACA GbMCAGAAG A AC TT AG A FC AXAFATA'T A-'A 
AGAGTAGCAA 0 7FGTATTG 7GTGGAXAA AGG A7 AG AG A 1 AAAAGAG AC CA-GGAAGGT -n: 
TTAGACAAGA TAGAGGAAGA GCAAAACAAA AuFAAGAAAA AAGCACAGG A AGGAGCAGAF LJf_-0 
GAlACAGGA.; AFAGCAATCA GGTCAGCGAA AATTACCCTA TAGTGCAGAA CAI'CCAGGGG 4GC 
CAAATGGTAi; AXAGGCCAT ATCACCTAGA AG TT 1AAA.TG r.'ATGGGTAAA AGTAGFAGAA -lfO 
GAGAAGGCTF T< AGC'XAGA AGTGATACCC AT GTT TIT AG CA1TATCAGA AGGAGXAGG 01 ( 
<; GACAAGATF TAAACACCAT GGTAAACACA G'lGGGGGGAG ATC AAGC AGG CATGCAAATG MM. 
TTAAAAGAGA CC^TCAATGA GGAAGCTGCA GAATGGGATA GAGTGCATCG AGTGCATGCA 660 
GGGCCTA1 FG CAOIAGGCCA GATGAGAGAA X AAGGGGAA GTGACA7AGG AGGAAITACT 7 AC 
AGIACCC iF-: AgGAACAAAT AGGATGGATG Ai AAATAAT-: i ACCTATXC AGFAGGAGAA ?:••:(■ 
AT1TATAAAA GATGGATAAT CCTGGGA1TA AATAAAATAG; TAAGAA.TGTA TAGCG7TAX Air 
AGCATTCTGG A'.ATAAGACA AGGACCAAAG GAAG G C TIT A. GAGACTATGT AGACGGGTK 9iKi 
TATAAAACTC TAAGAGCCGA GCAAGCTTC A GAGGAGGTAA AAAATTGGAT GAC AGAAACG %(■ 
1TGTTGGTCL AAAATGCGAA CCCAGATTGT AAGAQATTT 'iAAAAGCATT GGGAGCAGCG 1020 
GC 1 AC ACT AG MGAAATGAT GACAGCATGT CAGGGAG1AG GAGGACCCGG CCATAAGGCA 1U80 
AGAGTTTTGG CIGAAGCAAT GAGCCAAGTA ACAAATTCAG CTACCATAAT GATGC AGAGA 1140 
GGCAATTTTA GGAACCAAAG AAAGATTGTT AAGTGTTTCA ATTGTGGCAA AGAAGGGCAC 1200 
ACAGCCAGAA ATFGCAGGGC CCCTAGGAAA AAGGGC TGTT GGAAATGTGG AAAGGAAGGA 1260 
CACCAAATGA AAGATTGTAC TGAGAGACAG GCTAATTTTT 1AGGGAAGAT CTGGCCTTCC 1320 
T AC AAGGGAA GGCCAGGGAA TTTTCTTCAG AbCAGACCAG AGCCAACAGC CCCACCAGAA 1380 
GAGAGCTTCA GGTC TGGGGT AGAGACAACA ACTCCCCCTC AGAAGCAGGA GCCGATAGAC 1440 
AAGGAACTGT ATCCTTTAAC TTCCCTC AGG TC.ACTCTTTG GCAACGACCC CTCGTCACAA iSOO 
TAAAGATAGG GGGGCAACTA AAGGAAGCTC TATTAGATAC AGG AG C AG AT GATACAGTAT 1560 
TAGAAGAAAT GAGT1TGCCA GGAAGATGGA AA.CCAAAAAT GATAGGGGGA ATTGGAGGTT 1620 
TTATCAAAGT AAGACAGTAT GATCAGATAC TCATAGAAAT CTGTGGACAT AAAGCTATAG 1680 
GTACAGTATT AGTAGGACCT ACACCTGTCA ACATAATTGG AAGAAATCTG T1GACTCAGA 1740 
TTGGTTGCAC TTTAAATTTT CCCATTAGCC (TATTGAGAC TGTACCAGTA AAATTAAAGC 1800 
CAGGAATGGA TGGCCCAAAA GTTAAACAAT GGCCATTGAC AGAAGAAAAA ATAAAAGCAT 1860 
TAGTAGAAAT TTCJACAGAG ATGGAAAAGG AAGG'GAAAAT TTCAAAAATT GGGCCTGAAA 1AX 
ATCCATAC AA TAC TCCAGTA .TTGCCAIAA AGAAAAAAGA AGTACTAAA TGGAGAAAAT l'"-)80 
TAGTAGA1TT CAGAGAACTT AATAAGAGAA CTAAGAC7P ("TGGGAACJT CAVHAGGAA A040 
TACCACA1CC Ci ;C AGGGTTA AAAAAGAAAA AATCAGTAAC AGTAC 7GGA1 G 1 GGGTGATG 7U)0 
GATATTTTT C AGHTCCTTA GATGAAGACT T" AGGAAGTA IACTGCA1TT A( CATACCTA 2iti0 
GTATAAACAA TGAGACACCA GGGATTAGAT AiCAG". ACAA TGTGCTTCCA CAGGGATGGA A220 
AAGGATCACC AGCAATATTC CAAAGTAGCA TiiACAAAAAT CTTAGAGCCT TT7AGAAAAC 2280 
AAAATCCAGA CATAGTTATC WCAATACA TuGATGATTT i.JATGTAGGA TCTGACTTAG 7340 
AAATAGGGCA GCATAGAACA AAAATAGAGG AX TGAGAGA ACATCTGTTG AGGTGGGGAC 2400 
TTACCACACC AGACAAAAAA CATCAGAAAG AACCTCCATT CCTTIGGATG GGTTATGAAC 2460 
TCCATCCTGA TAAA.TGGACA GTACAGCCTA TAGTGCTGCC AGAAAAAGAC AC/.TGGACTG 2520 
TCAATGACAT ACAGAAGTTA GTGGGGAAAT TGAATTGGGC AAGTCAGATT TACCCAGGGA 2580 
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HAAAGTAAG G-'AAIIATGI AAACTCi IT A CAGGAACCAA AGCACTAA:A GAAGTAAAA .r.:f) 
'"iGFAAi'ACA A-.AAGCAGAG H AGAAAGb < AuAAAA: AG AGAGATTCTA AAAGAAGCA . AMU 
a'^a.; GiAnATGAC lAATCAAAAG ACTTAATAGC AGAAATAi AG AAGCAGGG.G- AnD 

aaaaaatg gacatatcaa at fiat ( aag? agccatttaa aaak tgaaa acag^aaaa- .jam 

AJ-AAAGAAf G-MGGTGCC i" AG AG T AATG ATGTAAAAC A AITAACAGAG GCAGTGCAA'- . 'HMD 

^aataa-.ta; ag-aa-aa f a gtaatatggg gaaagaciCC i'aaai ftaaa ctgoaata. gam 
/.^aggaaa; aa^gaaaca agtggagag agiattggca ago:acctgg atto: igag t '-m 
haaahgi ta' 1aaat 1 a •' ! i a ag a aaitatggta ccagt1agag a aag aaca -ag 
"ia-giaggag; aa-/aa ; a wgtagaig gggcagctaa cagggagact aaamagaa .mad 
/-a.: a.-vgai a ta am1aat aav.i aaga aaaaagttgt ca0a1aaa gacamaaca- a;a) 

A; AG AAGA'.! TwAAIAlAA GAATTTAA TAGATITGC A GGATI GGGGA TTAGAAGTAA A'4i) 
ACAfAGlAAC AgA: TCACAA JATGCATTAG GAATCATTCA AGCACAAGGA GATCAAAGTu 3 Ah.) 
AA1AAGAGT! AG KAATCAA AT.AAf AGAG( AGTTAATAAA AAAGG.AAAAG GTC1ATCTGG Attn 
i ATGGGTAO! A-.y. AC.Al'AAA GGAA1TGGAG GAAATGAACA AG TAG AT AAA TTAGKAGTo A^/Mj 
GTGGAA'ff AG GAAAGIACTA TTTUAGATG GAATAGATAA GGCC( AAGAT GAACATGAGA A480 
AA1AT;ACAG T/V-TTGGAGA GlAATGGCTA GTGATTTTAA CCTGCCACCT GTAGTAGCAA VM 
AAgAAAIAG! A'.; CAGUGT (ATAAATGA AGCTAAAAGG AGAAGCCATG CATGGACA/v. 36n0 
! AGA'AGTAG A. AGGAAIA TGGlAAC TAT] ATTGTACAC A TTTAGAAGGA AAAGTTATC - " 36GU 
AA.IAAAGl T- A1GIAGCC /-oTGGATAT- TAGAAGCAGA AGTTATTOIA GCAGAAACAG A' I'D 
GGC AGGAAAC AGi." ATATTTT C TTTTAAAA ! 1AGCAGGAAG ATGGCCAGTA AAAACAA1AG 3?y0 
AFACTGACAA iGu AGGAA" TTCACC&GTG GTACGGTTAG GGCCGCCTGT TGGT'GGGGGLi 3840 
bAATCAAGCA GGAATTTGGA A1TCCCTACA ATCCCCAAAG TC AAGGAGTA GTAGAATCTA 3'.-*00 
■GAATAAAGA AHAAAGAAA ATTAIAGGAg AGGTAAGAGA TC AGGGTGAA CATC TT AAGA A-AG 
CAGCAGTACA AATGGCAGTA TTCATCCACA ATTTTAAAAG AAAAGGGGGG ATTGGGGGG'l 4020 
ACAGTGCAGG GGAAAGAATA GTAGACATAA TAGCAACAGA CATACAAACI AAAGAATTAv 4080 
AAAAACAAAT TACAAAAATT CAAAATTTTC GGGTTTATTA CAGGGACAGC AGAAATTCAC 4140 
TTTGGAAAGG ACCAGCAAAG C1CCTCTGGA AAGGTGAAGG GGCAGTAGTA ATACAAGATA 4200 
ATAGTGACAT AAAAGTAGTG r CAAGAAGAA AAGCAAAGAT CATTAGGGAT lATGGAAAA: 4260 
AGATGGCAGG TGATGATTGT GTGGCAAGTA GACAGGATGA GGATI AG 4307 



SFQ ID. NO. 2 - gagpol-SYNgp - codon optimised gagpol sequence 

ATGGGCGCCC GCGCCAGCGT GCTGTCGGGl GGCGAGCTGG ACCGCTGGGA GAAGAfCCGC 60 
CTGCGCCCCG GCGGCAAAAA GAAGTACAAG CTGAAGCACA TCGTGTGGGC CAGCCGCGAA 120 
CTGGAGC.GCT TCGCCGTGAA CCCCGGGCTC CTGGAGACCA GCGAGGGGTG CCGCCAGATC 180 
(ATGGCCAAC TgCAGCCCAG CC r GCAAAC!" GGC.AGCGAGG AGCTGCGCAG CCTGTACAA'. 240 
AA'GIGGCCA OjU'GTACTG CGTCCACCAG CGCATCGAAA TCAAGGATAC GAAAGAGGA 300 
I TGGATAAAA Tf GAAGAGGA AC AGAATAAG AGC.AAAAAGA AGGCCCAACA GGCCGCCGCG 360 
GACACCGGAC A: AGCAACCA GGTC AGCCAG AACTACCCCA TCGTGCAGAA C ATCCAGGGG 420 
GAGAIGGTGC At lAGGCCAT CTCCCCCCGi. ACGCTGAACG CCTGGGTGAA GGTGGTGGAA 480 
GAGAAGGCTT T I AGCCCGGA GGTGATACCC ATGTTCTCAG CCCTGTCAGA GGGAGCCAOG 540 
CCCCAAGATC TG.AAC ACCAT GCTCAACACA GTGGGGGGAC ACCAGGCCGC CATGCAGATG 600 
C1GAAGGAGA C CATC AATG A GGAGGCTGCC GAATGGGATC GTGTGCATCC GGTGCACGCA 660 
GGGCCCATCG CACCGGGCCA GATGCGTGAG CCACGGGGCT CAGACATCGC CGGAACGAA" 720 
AGTACCCTTC AGGAACAGAT CGGCTGGATG ACCAACAACC CACCCATCCC GGTGGGAGAA 780 
AT( TACAAAC Gi TGGATC AT CUGGGCCTG AACAAGATCG TGCGCATGTA TAGCCCTACC 840 
AGCATfCTGG ACATCCGCCA AGGCCCGAAG GAACCCTTTC GCGACTACGT GGACCGGTTC 900 
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tg.:g g!.gg- gcagggtagg caggagg:?,- agaagggg*^ gaogvac: 96 j 
ctggiggf c agaagg :gaa l !xg gact gg aagaggai:: xaagggxt ggo:c:cagcg 1030 
gctagg;t'g agg—afgat ga-:; ^vtgt cagggagi ;g gt,g*::!Gg : :a:a/Gv.a iqao 

CGCGl'iGloG :] FGAGGGGAT GA^! GAGG TG ACGAAGI lGG CT'l !AK -A" GAIGGAG-GG li-'.O 

GGC.V* ! TTTC GGAA-" GA-GG f-V^AI !gI: AAG'iG !TT ! A A"GH;!-A AGAA^GGv 13--D 

AG AG ' ! . G-! A AgTgGAggG! L^iAGG.^A AA-jiGG-'TG! F GGv^r>!!G • A/- v,AAGG 1_V; 

lAGGAGATGA AAGA ! I'G'I AG IG-gAGA-G GC lA'TF :T i '-GGGA^GAT ■! TgG' '!^ I G 

[aga;oGG~a GoC-:aggg.aa t ftt-tfg-g ag:agaglag aggaag~gi -g-a'AlAi^a i.G'> 

GAGAg'TKA GG FG TGGGGT AGAGAGAAGA AATGGGaTG A^AgG-^A gGG; A IAg A- 1-40 

aaggaagIg- atogittaag t fga: kagA tg-:f-:tttg g. aaglagg >.v g(ag'- i-oo 

1AAAGAFAG3 GGGG!AG'! FG AAGGAGGC !G: "GGFGGAl AG GGAoGAg-] GA .A'0!GIG-' l.v-0 

TGGAgGAGAT GTCGFTGCGA GGG!GGFGgA AGCOGAAGAT gA ' GGGGGGA AFv ^GaG-GG F 7 lo/O 

] fATGAAGGT GiGCCAGTAT GAG! AG AT G! T'iATitiAAAF uGG »Y/j:. AAGG : ! T A FG i l'-Su 

G'lACGLFG'! T GGTGGGCCGG ACAOGOVTCA A 1 '! ATCATCGG A" G-!AAO!TG "■ FgACGCAGA 1 !•'•■: i; 

TCGGT FGCAC GG 1 GAAG T K OaATTAGO! C FAKGAGAl G-.jIAAGGGTu AAg'GGAAG' 18oc 

0!GGGATGGA GGGG!GGAAG GFCAAGGAAT GGCCAIT'GAG AGAGGAGAAG ATGAAGGC A= mi 

TGGTGGAGAF TTGG^CAGAG ATGGAAAAGG AAGGGAAAAT : ! T G! AAGATF GGG!!TGAGA I'HG'f "• 

AO! CgTACAA CACGlG GGTG I FCGCAA.TAA AGAAGAAGGA UOGAGGAAA TGGGjLAAG- 1 ,j 8c 

TGGTGGACTT G CGCGAGC FG AAlAAG(G'!A GGCAAgAlTT '! 'GGGAGG1 ' AGG TGGGG - GOAc 

TCCCGCACCC CGCAGGGCTG AAGAAGAAGA AA1CCGTGAG GG7ACTGGA' GTGGGjTGATm "IOC 

CCTAUTCIC CGTTG CCCTG GAGGAAGAGf TCAGGAAGIA G ACTGCCTTC AGAATGC C T ! Alt ••• 

cgat; aagaa cgagagaccg gggattcgat atcagfacaa. g!jTgc fggo! gagggctgga 222c 

AAGGC TCTCC CGCAAFCTTC CAGAGTAGGA TGACCAAAAF m.GTGGAGCu 7 FOGGCAAAl 2380 
AGAACCCCGA CATCGTCATC TATCAGTACA TGGATGACTT GTAC GTGGGC TCTGATCTAG 2340 
AGATAGGGGA GCACCGCACC AAGATCGAGG AGCTGCGGCA GGACCTGTTG AGGTGGGGAC 2400 
TGACGAC ACC CGACAAGAAG CACCAGAAGG AGCC1CCGTT CCTCTGGATG GGTTACGAGC 2460 
TGCACCCTGA CAAATGGACC GTGGAGCCTA TCGTGCTGCC AGAGAAAGAC AGCTGGACTG 2520 
TCAACGACAT ACAGAAGCTG GTGGGGAAG'F TGAAGTGGGF CAGTCAGA1T 'I AGCCAGGGA 2580 
TTAAGSTGAG GCAGC FGTGC AAAC TCCTCC GCGGAACCAA GGC AGTCAC A GAGGTGATCG 264U 
CUTAACCGA GGAGGCCGAG CTCGAAG1GG CAGAAAACCG AgAGATGCTA AAGGAGCCCG G70g 
TGGAGGGCGT GTACTATGAC CCCTCCAAGG ACCTGATCGC CGAGATCCAG AAGCAGGGGG 2760 
AAGGCCAGTG GACCTATCAG ATTTACCAGG AGGGCTTGAA GAAGC IGAAG ACCGGCAAGT 2820 
ACGCCCGGAT GAGGGGTGCC CACACTAACG ACGTCAAGCA GC1GACCGAG GCCGTGCAGA 2880 
AGATCACCAC CGAAAGCATC GTGATCTGGG GAAAGACTCG TAAGTTCAAG CTGCCCATCC 294U 
AGAAGGAAAC CTGGGAAACC TGGTGGACAG AGTATTGGCA GGCCACCTGG AHCCTGAGT 3000 
GGGAGTTCGT CAACACCCCT CCCCTGGTGA AGCTGTGGTA CC AGO TGG AG AAGGAGGCCA 3060 
TAGTGGGCGC CGAAACCTTC 1 ACGTGGAFG GGGCCGCTAA (AGGGAGACT AAGCTGGGCA 3120 
AAGCCGGATA CGTCACTAAC CGGGGCAGAt. AGAAGGTTCJ C AGCCTCAC ■ ( jAG A(X AC C A 3irr.; 
ACLAGAAGAC TGAGGTGGAG GCCATTTAGC TCGCTTTGC A GGAClCGGGt ( TGGAGGTGA 334-0 
ACATCGTGAC AGACTCTC.AG TATGCCC3GG GCATCATTCA AuGCCAGCCA GA 1 ! ("AG AG .G 3^)0 
AGTCC GAGCT GGTCAATCAG ATCATCGAGl AGCTGATC AA GAACjGAAAAG GTGIATC GGo 3360 
CCTGGGTACC CGCCCACAAA GGCATTGC£G GCAAT(aAGCA G/FCGACAAij G1GGTCTCGG 3420 
CTGGCATGAG GAAGGTGGTA TTO'TGGATG GCATCGACM GGCCC AGGAG GAGCACGACiA 3480 
AATACCALAG CAACTGGCGG GCCATGGCTA GCGACTTCAA CGTGCCCCCT GTGGTG&'CA 3G40 
AAGAGATGGT GGCCAGCTGT GAC AAGTGTG AGCTCAAGGG CgAAGCCATi. CATGGCCAGG 3600 
TGGACTGTAG CCCCGGCATC TGGCAACTGG ATTGfACC(!A IGTGGAGGGg AAGGTTAUG 3660 
TGiTAGCCGT CCATGTGGCC AGTGGCTACA TCGAGGCCGA GGTUHCCo GCGGAAAi'AG 3720 
GGCAGGAGAC AGCCTACTTC CT( CTGAAGC TGGCAGGCCG GTGGCCAGTG AAGACGATGG 3780 
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ATACIGACM FGGCAGCAA7 TTCACCAG'FG CTAC GGTTAA GGOXXTGG TGGTGGGGGG .38.10 

GAATCAAGCA GGAGTTCGGG ATCCCCTAGA ATCCCGAGAG TCAGuGCGl 1 . G TCGAGiOFA l.^O 

1GAA1AAGGA G'TTAAAGAAG ATIATCGGOI AGGKAGAGA TC AG'GG TGAi^ GYTOT-CAALA Vm; 

CCG0GG7CCA AATGGCGGTA TTG-.TCCAlA ATTFCAAGCG GAAGGGGGGG Ai ii/Z^G-of 4«'V0 

A 1 AG^GCG..; GGAGLGGATC GTGGACATlA TCGlGACCGA -iATCLAGACT AAGGAGGFiG -irtO 

AAAAGCAGA1 TA'.CAAGATI CA^AATTI'lC GGGKTACfA CAGGGACAGl AGAAAKO; C -1 .40 

TCTGGAAAGl u CAGCGAAG lT'GTCTGGA AGGGTGAGGG GGGAGTAG7G ATC ■: AGG ATA A"O0 

AOV'y -WAf ; A/.G'.; IGG : ... C CC AGAAGAA AGGCGAAGAT AiTAbGG- i TAG.GG^AAC ■ K r -0 

A-:iA PSiGC i^'oG IoATGATIGl GTGGCGAGCA GACAGGATGA GGATTAG 4.'ii7 



SHQ. ID. NO. "5 - Lnvelope Gene from HIV-1 MN (Genbank accession no. MI 7449) 

A IGAGAGTGA AGGGGATCAG GAGGAATTAT CAGCACTGGT GGGGATGGGG CACGATGC'IC <;A 
; 1 I'GGG [TAT TAATGATCFG TAGTGC TAGA GAAAAATTGT GGGTCACAG1 CTATTATGGG G'i. 
GTACi T'GTGl GGAAAGAAGC AACCACCACT CTATTTTGTG GATCAGATG'C TAAAGCATAT 18c 
GATAGAGAGG 1 ACATAA1GT TTGGGC CAGA CAAGCC TGTG TAGCCACAGA CCCC AAGO! A 2'.n 
(AAGAAGTAG AATTGGTAAA TGTG AC AGAA AATTTTAACA TGTGGAAAAA. TAAC ATGGTA 3l=5 1 
GAAGAGATGl A1GAGGATAT AAFGAGTTTA TGGGATCAAA GCGFAAAG'G ATGTSTAAAA 3f : '(j 
Tl'AAC CCCAC TCTG1GTTAC TTTAAATFGC ACTGA1TTGA GGAAl'ACTAC TAATACCAAT 4 t r- 
AATAGTACTG ( 1 AATAACAA TAGTAATAGC GAGGGAACAA TAAAGGGAGG AGAAATGAAA 460 
AAC FGCTCT'l 1CAATATCAC CACAAGCATA AGAGATAAGA TGGAGAAAGA ATATGCACTT 54;) 
CTTTATAAAG T1GAT ATAGT ATCAATAGAT AATGATAGTA GGAGCTATAG GTTGATAAGT 6'JO 
TGTAATACCT C.AGTC ATT AC ACAAGCTTGT CCAAAGATAT CCTTTGAGCC AATFCCCATA 6bU 
CACTATTGT6 CCCCGGCTGG TTTTGCGATT CTAAAATGTA ACGATAAAAA GTTCAGTGGA 720 
AAAGGATCAT GTAAAAATGT CAGCACAGTA CAATGTACAC ATGGAATTAG GCCAG I'AGTA 760 
TCAACTCAAC TGCTGTTAAA TGGCAGTCTA GCAGAAGAAG AGGTAGTAA1 TAGATCTGAG 840 
AATTTCACTG ATAATGCTAA AACCATCATA GFACATCTGA ATGAATCTGT ACAAATTAAT 900 
TGTACAAGAC CCAACTACAA TAA/\AGAAAA AGGATACATA TAGGACCAGG GAGAGCATTT 960 
TATACAACAA AAAATATAAT AGGAACTATA AGACAAGCAC ATTGTAACAT I AGFAGAGCA 1020 
AAATGGAATG ALACT TTAAG ACAGATAGTT AGCAAATTAA AAGAACAATT TAAGAATAAA 1080 
ACAATAGTCT TTAATCAATC CTCAGGAGGG GACCXAGAAA TTGTAATGCA CAGTTTTAAT 1140 
TGTGGAGGGG AATTTTTCTA CTGTAATACA TCACCACTGT TTAATAGTAC TTGGAATGGT 1200 
AATAATACTT GGAATAATAC TACAGGGTCA AATAACAATA TCACACTTCA ATGCAAAATA 1.160 
AAACAAATTA TAAACATGTG GCAGGAAGTA GGAAAAGCAA TGTATGCCCC TCCCATTGAA 1320 
GGACAAATTA GATGTTCATC AAATATTACA GGGCTACTAT TAACAAGAGA TGGTGGTAAG 1380 
GACACGGACA CGAACGACAC CGAGATCTTC AGACCTGGAG GAGGAGATAT GAGGGACAAT 1440 
TGGAGAAGTG AATTATATAA ATATAAAGTA GTAACAATTG AACCATTAGG AG1AGCACCC 1500 
ACCAAGGCAA AGAGAAGAGT GGTGCAGAGA GAAAAAAGAG CAGCGATAGG AGCTCTGTTC 1560 
CTIGGGTICT TAGGAGCAGC AGGAAGCACT ATGGGCGCAG CGTCAGTGAC GCTGACGGTA 1620 
CAGGCCAGAC TATTATTGTC TGGTATAGTG CAALAGCAGA ACAATTTGCT GAGGGCCATT 1680 
GAGGCGCAAG AGCATATGTT GCAACTCACA GTC1GGGGCA TCAAGCAGGT GCAGGCAAGA 1740 
GTCCTGGCTG TGGAAAGATA CCTAAAGGAT CAACAGCTCC TGGGGTTTTG GGGTTGCTCT 1800 
GGAAAACIGA TTTGCACCAC TACTGTGCCT 1GGAATGCTA GTTGGAGTAA TAAATCTCTG 1860 
GATGATATTT GGAATAACAT GACCTGGATG CAGTGGGAAA GAGAAATTGA CAATTACACA 1920 
AGCTTAATAT ACTCATTACT AGAAAAATCG CAAACCCAAC AAGAAAAGAA TGAACAAGAA 1980 
T1 ATTGGAAT TGGATAAATG GGCAAGTTTG TGGAATTGGT TTGACATAAC AAATTGGGTG 2040 
TGGTATATAA AAATATTCAT AATGATAGTA GGAGGCTTGG TAGGTTTAAG AATAGTTTTT 2100 
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;atagtgaa fagagttaf 



GATGT/V 

ggcfotag ? r'vj'jAn afofgaaagg a: f'aa.. aa 

AGAGAFAGAG ACACATFO^ TCFiA TTAGTG 2ATGGATI :T A'ACAATTA 



iAAbAA'.iA AbG I iAjAGAG 



:akgata:t caccaftgtc gttffagaca afo 

ygf.afcga: ffao 

CTGOjGAGCC TGT7FFTFTI AAaTACCA-A AAAAGAlAAT T-'-F.TC TTGA7 TGC'-GCGAGG A 340 

attgiggaa: hft^gaf] :a:^gggtg; ;aagt:-\t :.a a'iattggil -v-a iatc a: a :-m 

FAFA'TTGGA GT.:AA.GA'AT .AAA.AGIAGF GC7GTTAG::T f ATTAAFA: F ACA V. TA"; A ."■'■>,(; 
A 7 'GF AGGG^ACAGA 7AGGG1TA FA -vV-GTA 1 ' t, ; A/- AG. AAA F A 3 A' A' AAA* ' : . 'AC 

ftofafata-; c iac aagaa r aagac ag& v. ftggaaagGj .affafafa ■ . •;= :. 



SI:*,) I D. NO. 4 - SYNl'jv] 60mn - codon optimised cn\ sequence 

ATGAGGGTGA AGGGGATFFG FCGCAAF7AF 'i AFA AC1G : J G iGG'"TGGGG ■■.AOi.-Af GO' . A,; 

OIGG-sGCTfr! TGATGATOIG F AGCGCCAOF GAGAAGCTG7 GgGTGAFF GT o'-FTACGG' iA< 

GTGOCCGIG'I GGAAGGAGGF CACCACCAFl FTbTTCTGFG ./'AGCGAOLC CAAGGOGTAA 180 

GACAFCGAGG TGCACAAlGT GTGGGCOAFF C AGGCGTGOi TGCOFAOIGA Ff. FAACCCG l v l t' : 

LAGGAGGTGG AGOKG7GAA CGTGACCGAG AAOTCAAAA "VjiGGAAGAA FAACATGGTG AAA 

bAGG AGA7GG AT&AGGACA7 CATCAGCCIG TGGGACCAGA G AF7GAAGCC .'TGFGTGAAd AO 

FTGAFCCFCC TGIGlGTGAF FOIGAAFTjF AClGACCTGA GijAAFAFGAC FAAFAFCAAi. AAA 

AACAGCACCG CCAACAACAA FAGCAACAGF GAGGGCAFFA t. AAGGGCGG IGAGATGAAG 481 

AACToCAGCT TCAAOVTOV: > AfCAGCATl CGCGACAAGA O.FAGAAGGA G" r ACGCO!TG 540 

FTGTACAAGC TGGAT AT G(G 4 GA&CATCGAF AACGAlAGFA iVAGCTACCG (" TbATCTC 600 

TGFAACACCA GCGTGA1 CAC CCAGGCCTGG CCLAAGATCA GlTCGAGCC FAIFCFCAli oGO 

OAFTACTGCG CCCCCGCCGG CnCGCCATC CTGAAGTGCA ACGACAAGAA GTTCAGCGGC 7A0 

AAGGGCAGCT GCAAGAACGT GAGCACCGTG CAGTGCACCA AFGGCATCCG GC.CGGTGGTG 780 

AGCACCCAGC TCCTGCTGAA CGGCA&CCTG GCCGAGGAGG AGGTGGTGAT CCGCAGCGAG 840 

AACT7CACCG ACAACGCCAA GACCATCATF GTGCACCTGA A7GAGAGCGT GCAGATCAAF 900 

TGFACGCGTC CCAACTACAA CAA&CGCAAG CGCATCCAGA O GgCCCCGG GCGGGCCTTi" %0 

TACACCACCA AGAACATCA1 CGGCACCATC CGCCAGGCCC AUGC AACAT FTF TAGAGCC 10A0 

AAG7GGAACG ACACCCTGCG CCAGATCGTG AGCAAGCTGA AGGAGCAGTT CAAGAACAAG 1080 

AC CATCGTGT TCAACCAGAG CAGCGGCGGC GACCCCGAGA FCGTGATGFA ;.AGF FTCAAC 1140 

TGFGGCGGCG AATTCTTCTA CTGCAACACC AGCCF CCTGT VAACAGCAC A IGGAACGGA 1200 

AACAACACCT GGAACAACAC lACCGGCAGC AACAACAATA IIACCCTCCA G"l GCAAGATC 1260 

AAGCAGATCA TCAACATGTG GCAGGAGGTG GGCAAGGCCA 7G7AFGCCCC CCCCATCGAG 132'0 

&GCCAGATCC GGTGCAGC AG CAACATCACf GGTCTGCTG' TGACCCGCGA CGGCGGCAAG 1380 

GACACCGACA CC.AACGACAC CGAAATCTTA CGCCCCGGCG GFGGFGAOAT GCGCGACAAC 1440 

7'GGAGATCTG AGCTGTACAA GTACAAGGTF; GTGAC GATFG AG' F f ' CTGGG CGTGGC ("GO. r"00 

AiCAAGGCCA AGCGCCGCGT GGTGCAGC GA GAGAAGCGGj A-,i; .0: ATCGG -AGO! CI GIT. I'r.O 

CTGGGCTTCC TGGGGFiCGGC GGGCAGCACF ATGGijGGCAG A'.A;aA^TGAF 'A' I GAF CGT'.. 1 iuAO 

AAGGCCCGCC TGCTCCTGAG CGGCATCGTG CAGCA&CAgA A ! .AA( FTF FT F 1 GAG( F AT- Il.MO 

GAGGCCCAGC AGCA1 ATGCT CCAGCTCACC GTGT&GGGC A AAGCAGCT CF AGGCC C(j ; 1-4D 

GTGCTGGCCG TGGAGCGCTA CCTGAAGGAi. GAGCAGCTCG I"GGGF TTCTG GuGfTGLl'O: 1800 

GGCAAGCTGA TCTGCACCAC CACGGTACCf TbGAACGCH AAiGFiAGCAA CAAGAGCC i-i I80O 

GACGAGATCT GGAACAACAT GACC.TGGATG GAGTGGGAOF Gi GAGATCGA 'MC I ACA,i 19A0 

AGCCTGATCT AC AGCCTGCT GGAGAAGAGF CAGACCCA^A AGGAGAAGAA CGAGCAGGA.i 1980 

CTGCTGGAGC TGGACAAGTG GGCGAGCCTG TGGAACTGGT TFGAAATCAA F AA( TGGCTG 2040 

TGGTACATCA AAATCTTCAT CATGATTGTG GGCGGCCTGG TijGGCCTCCG AATCGTGTTG :i00 

GCCGTGCTGA GCATCGTGAA CCGCGTGCGC CAGGGCTACA GCCCCCTGAG CrTCCAGAC.C 2160 
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IGGCCCOT.G TCr C'^lGCGG GCGCGACCG 1 '; cccgagggca tcgaggagga GGGCGGCGAG 2220 
CGCGAC CGCG AlAGOaGCGG l AGGCTCGTG CACGGCTTCC TGGCGATCAT LTGGGTCGAC 22W 
CTGCGCAGGC TGTT:CTGTT CAGCTACCA: C AC CGCGAC C TGCTGCTGAT C GCCGGCCGC .2.34(1 
AT-JGTGGAAi: TCCTAGGCCG lCGCGGCTGG GAGGTGCTGA AGTACTGGTG GAACCFCCTC SM 
CAGTATTGGA GCCAGGAGCT GAAGKXAG' GCCGTGAGCC TGCTGAACGC CACCGCCATC 2A60 
GC2GTGGCCG A ^ A:> GA CCGCGTGATC GAGGTGCTCC AGAGGGCCGG GAGGGCGATC ."'520 
CTGCACATCC CCAiIG CGGAT CCGCCAGGGu CTCGAGAGGG CGGTGCTGTA A 2571 
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